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Quasi-randomness is determined by the distribution of copies of a 
fixed grapfi in equicardinal large sets 
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Abstract 



For every fixed graph H and every fixed < a < 1, we sliow tliat if a graph G has the 
, property that all subsets of size an contain the "correct" number of copies of H one would 

I expect to find in the random graph G{n,p) then G behaves like the random graph G{n,p)] that 

is, it is p-quasi-random in the sense of Chung, Graham, and Wilson [3]. This solves a conjecture 
raised by Shapira [8] and solves in a strong sense an open problem of Simonovits and Sos [9] . 



1 Introduction 



graphs are such that any graph that satisfies them, resembles an appropriate random graph (namely, 
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^T) ' The theory of quasi-random graphs asks the following fundamental question: which properties of 

in 

I the graph satisfies the properties that a random graph would satisfy, with high probability). Such 

I properties are called quasi-random. 
QQ I The theory of quasi-random graphs was initiated by Thomason |10[ [TT] and then followed by 



Chung, Graham and Wilson who proved the fundamental theorem of quasi-random graphs [1]. 
Since then there have been many papers on this subject (see, e.g. the excellent survey [6J). Quasi- 
^ I random properties were also studied for other combinatorial structures such as set systems [1], 

5^ \ tournaments [2], and hyper graphs [3j. There are also some very recent results on quasi-random 

groups [5j and generalized quasi-random graphs [7]. 

In order to formally define p-quasi-randomness we need to state the fundamental theorem of 
quasi-random graphs. As usual, a labeled copy of a graph H in a graph G is an injective mapping (p 
from V{H) to V{G) that maps edges to edges. That is {x,y) G E{H) implies {(j){x) , (j){y)) E E{G). 
For a set of vertices U C y{G) we denote by H\U] the number of labeled copies of H in the 
subgraph of G induced by U and by e{U) the number of edges of G with both endpoints in U . A 
graph sequence (G^) is an infinite sequence of graphs {Gi, G2, . . .} where Gn has n vertices. The 
following result of Chung, Graham, and Wilson [IJ shows that many properties of different nature 



*Department of Mathematics, University of Haifa, Haifa 3f905, Israel. E-mail: raphy@math.haifa.ac.il 



are equivalent to the notion of quasi-randomness, defined using edge distribution. The original 
theorem lists seven such equivalent properties, but we only state four of them here. 

Theorem 1 (Chung, Graham, and Wilson [4|) Fix any 1 < p < 1. For any graph sequence 
{Gn) the following properties are equivalent: 

Vi{t): For an even integer t > 4, let Ct denote the cycle of length t. Then e{Gn) = |pn^ + o(n^) 
and Ct[Gn] = p^n* + o(n*). 

V2: For any subset of vertices U C V{Gn) we have e{U) = + o(n^). 

Vs-' For any subset of vertices U C V{Gn) of size n/2 we have e{U) = ^p|t/p + o(n^). 

Vi{a): Fix an a ^ {0, For any U <^ V{Gn) of size an we have e{U,V\U) = pa{\ — a)n^ + o{n'^). 

The formal meaning of the properties being equivalent is expressed, as usual, using e, 5 notation. 
For example the meaning that implies V2 is that for any e > there exist 5 = 6{e) and = A^(e) 
so that for all n > A^, if G is a graph with n vertices having the property that any subset of vertices 
U of size n/2 satisfies \e{U) — < 6n^ then also for any subset of vertices W we have 

\e{W) - \p\W\^\ < en2. 

Given Theorem [1] we say that a graph property is p-quasi-random if it is equivalent to any (and 
therefore all) of the four properties defined in that theorem. (We will usually just say quasi-random 
instead of p-quasi-random since p is fixed throughout the proofs). Note, that each of the four 
properties in Theorem [1] is a property we would expect G{n,p) to satisfy with high probability. 

It is far from true, however, that any property that almost surely holds for G{n,p) is quasi- 
random. For example, it is easy to see that having vertex degrees np(l-|-o(l)) is not a quasi-random 
property (just take vertex-disjoint cliques of size roughly np each). An important family of non 
quasi-random properties are those requiring the graphs in the sequence to have the correct number 
of copies of a fixed graph H. Note that Vi{t) guarantees that for any even t, if a graph sequence has 
the correct number of edges as well as the correct number of copies oi H = Ct then the sequence is 
quasi-random. As observed in fIJ this is not true for all graphs. In fact, already for H = there 
are simple constructions showing that this is not true. 

Simonovits and Sos observed that the standard counter-examples showing that for some graphs 
H, having the correct number of copies of H is not enough to guarantee quasi-randomness, have the 
property that the number of copies of H in some of the induced subgraphs of these counter-examples 
deviates significantly from what it should be. As quasi-randomness is a hereditary property, in 
the sense that we expect a sub-structure of a random-like object to be random-like as well, they 
introduced the following variant of property Vi of Theorem [H where now we require all subsets of 
vertices to contains the "correct" number of copies of H. 
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Definition 1.1 (Vh) For a fixed graph H with h vertices and r edges, we say that a graph sequence 
(Gn) satisfies Vh if all subsets of vertices U C V{Gn) with \U\ = an satisfy H[U] = p^\U\^ + o{n^). 

As opposed to Vi, which is quasi-random only for even cycles, Simonovits and Sos [9] showed 
that Vh is quasi-random for any nonempty graph H. 

Theorem 2 For any fixed H that has edges, property Vh is quasi-random. 

We can view property Vh as a generalization of property V2 in Theorem [H since V2 is just the 
special case Vk2- Now, property V3 in Theorem [1] guarantees that in order to infer that a sequence 
is quasi-random, and thus satisfies V2, it is enough to require only the sets of vertices of size n/2 
to contain the correct number of edges. An open problem raised by Simonovits and Sos [9], and in 
a stronger form by Shapira [S], is that the analogous condition also holds for any H. Namely, in 
order to infer that a sequence is quasi-random, and thus satisfies Vh, it is enough, say, to require 
only the sets of vertices of size n/2 to contain the correct number of copies of H. Shapira [8] proved 
that is it enough to consider sets of vertices of size n/{h + 1). Hence, in his result, the cardinality 
of the sets depends on h. Thus, if H has 1000 vertices, Shpiras's result shows that it suffices to 
check vertex subsets having a fraction smaller than 1/1000 of the total number of vertices. His 
proof method cannot be extended to obtain the same result for fractions larger than l/(h + e). 

In this paper we settle the above mentioned open problem completely. In fact, we show that 
for any H, not only is it enough to check only subsets of size n/2, but, more generally, we show 
that it is enough to check subsets of size an for any fixed a € (0, 1). More formally, we define: 

Definition 1.2 {Vh,q) For a fixed graph H with h vertices and r edges and fixed < a < 1 we 
say that a graph sequence (Gn) satisfies VH,a if o-H subsets of vertices U C V{Gn) with \U\ = [an\ 
satisfy H[U] = p''\U\'' + o{n^). 

Our main result is, therefore: 

Theorem 3 For any fixed graph H and any fixed < a < 1, property VH,a is quasi-random. 

2 Proof of the main result 

For the remainder of this section let H he a fixed graph with h > 1 vertices and r > edges, and let 
a £ (0, 1) be fixed. Throughout this section we ignore rounding issues and, in particular, assume 
that an is an integer, as this has no effect on the asymptotic nature of the results. 

Suppose that the graph sequence (Gn) satisfies VH,a- We will prove that it is quasi-random by 
showing that it also satisfies Vh- In other words, we need to prove the following lemma which, 
together with Theorem [2l yields Theorem [3l 
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Lemma 2.1 For any e > there exists N = N{e, h, a) and 5 = (5(e, h, a) so that for all n> N , if G 
is a graph with n vertices satisfying that for all U C V{G) with \U\ = an we have \H[U] —p'^\U\^'\ < 
6n^ then G also satisfies that for all W C V{G) we have \H\W] —p'^\W\^\ < en^. 

Proof: Suppose therefore that e > is given. Let N = N{e, h, a), e' = e'(e, h, a) and 6 = 6{e, h, a) 
be parameters to be chosen so that N is sufficiently large and 5 <C e' are both sufficiently small to 
satisfy the inequalities that will follow, and it will be clear that they are indeed only functions of 
e, h, and a. 

Now, let G be a graph with n > N vertices satisfying that for all U C V{G) with |?7| = 
an we have |-ff[f/] — < 6n^. Consider any subset W C V{G). We need to prove that 

\H[W] -pH^I'^I < en''. 

For convenience, set k = an. Let us first prove this for the case where \W\ = m > k. This case 
can rather easily be proved via a simple counting argument. Denote by U the set of (™) /c-subsets 
of W. Hence, by the given condition on fc-subsets, 

(^^ ifk'^ - 5n'^) < X: H[U] < (^^ {p'k'^ + dn'^) . (1) 
Every copy of if in appears in precisely {^Zh) distinct U & it follows from ([1]) that 

Hm = 7;^ E H[U] < -^yk'^ + Sn'^) < fm'^ + -n^ , (2) 
\k-h) U&A \k-h) 



and similarly from ([T]) 

Hm = j^J: H[U] > j^Afk'^ - Sn'^) > fm'^ - -n^ . (3) 
\k-h) U&A \k-h) 

We now consider the case where = m = (5n < an = k. Notice that we can assume that 
P > e since otherwise the result is trivially true. The set ?i of //-subgraphs of G can be partitioned 
into h + 1 types, according to the number of vertices they have in W. Hence, for j = 0, . . . ,h let 
TCj be the set of //-subgraphs of G that contain precisely j vertices in 1/ \ W. Notice that, by 
definition, \7io\ = H[W]. For convenience, denote wj = \'Hj\/n'^. We therefore have 

1^1 H[V] , 

WQ + wi^ ywh = —r = — 7— = P + fJ- (4) 

n n 

where < e'/2. 

Define A = and set ki = k + iXn for i = 1, . . . ,h. Let Yi C V \ W he a random set of 

ki — m vertices, chosen uniformly at random from all (^,~ m) subsets of size ki — mofV\W. Denote 
Ki = Yi U W and notice that \Ki\ = ki > an. We will now estimate the number of elements of Tij 
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that "survive" in Ki. Formally, let Tij^i be the set of elements of Tij that have all of their vertices 
in Ki, and let ruj^i = Clearly, mo,j = since W d Ki. Furthermore, by ([2]) and dS]), 

mo,i + mi,j + • • • + nih^i = H[Ki] = p'^k'l + piu'' (5) 

where pi is a random variable with \pi\ < e'/2. 

For an i/-copy T € TCj we compute the probability pj^i that T € //[ii'j]. Since T € H[Ki] if and 
only if all the j vertices of T in y \ 1^ appear in Yi we have 

in—m~j\ . , , • 1 \ 

_ Kki-m-j) _ [ki - m) ■ ■ ■ [ki - m - J + 1) 

Pj,i 



(n - m) • • • (n - m - j + 1) 

Defining Xi = {ki — m) /{n — m) and noticing that 

ki — m a — 13 A 
* n — m 1 — /3 1 — /3 

it follows that 



Pj,i - 4 



Clearly, the expectation of mj^j is E[mj^j] = Pj^i\H.j\. By linearity of expectation we have from 
^ that 

E[mo,i] + E[mi,i] + • • • + E[m;,,i] = ^{ii\K,\\ = p^'k'^ + E[pi]n\ 
Dividing the last equality by we obtain 

Po,i'Wo H h Ph,iWh = p"" (a + Ai)'* + E[pi] . (7) 

By ([6]) and ([7]) we therefore have 

h 

^xjwj = p'^ {a + Xi)'^ + Pi (8) 

j=0 

where pi = E[pi] + Q and < e'/2. Since also \pi\ < e'/2 we have that \pi\ < e'. 

Now, @ and (l8|) form together a system of /i + 1 linear equations with the h + I vari- 
ables Wo, . . . ,Wh. The coefficient matrix of this system is just the Vandermonde matrix A = 
A{xi, . . . , Xfi, 1). Since xi, . . . , Xh, 1 are all distinct, and, in fact, the gap between any two of them 
is at least A/(l — /?) = (! — a)/((/i + 1)(1 — /?)) > (1 — a)/(/i + 1), we have that the system has 
a unique solution which is A~^b where b G R^'^^ is the column vector whose z'th coordinate is 
p"^ {a + Xi)^ + Pi for i = 1, . . . ,h and whose last coordinate is p^ + p. Consider now the vector b* 
which is the same as b, just without the piS. Namely b* € R^^'^ is the column vector whose z'th 
coordinate is p^ (a + Xi)^ for i = 1, . . . , /i and whose last coordinate is p"^ . Then the system A~^b* 
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also has a unique solution and, in fact, we know explicitly what this solution is. It is the vector 
w* = {wq, . . . , w'D where 

Wj = p 



Indeed, it is straightforward to verify the equality 



7=0 



J 

and, for alH = 1, . . . , the equalities 

E + T^p^yp' - = (« + ^^)'^ • 

Now, since the mapping F : R^~^^ R^^^ mapping a vector c to A~^c is continuous, we know that 
for e' sufficiently small, if each coordinate of c has absolute value less than e', then each coordinate 
of A~^c has absolute value at most e. Now, define c = b — b* = (/xi, . . . , ///j, //). Then we have that 
each coordinate Wi of A~^b differs from the corresponding coordinate of A~^b* by at most e. In 
particular, 

\wo — Wq \ = \wo — p^P'^\ < e. 

Hence, 

-nV/3''| = \H[W] -pH^h < e"'' 
as required. ■ 
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